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ABSTRACT 


BACKGROUND AND OBJECTIVES: Data mining is known as a process of discovering patterns in 
large datasets through a combination of statistical tools and techniques. In recent years, data 
mining and its applications in different businesses have increasingly grown. Insurance industry 
is one of the data-driven businesses whose survival is so dependent on satisfying customers 
besides achieving the highest benefit. Information or data is a vital asset of the insurance 
industry; accordingly, using data mining techniques to discover patterns behind large datase 
is a need. Having seen the increasingly high rate of information technology and recorded da 
in data-driven businesses, lots of industries like the insurance industry have been urged to us 
state-of-the-art data mining techniques to turn raw data into useful information using Big Da 
Analytics. 

(METHODS: Looking at the current research on data mining applications in the insurance 
industry proves the fact that we should recognize the state-of-the art techniques in data 
mining and set new strategies to focus on Big Data Analytics more. Big Data Analytics consists 
of the algorithms which are more efficient and less time-consuming so it can help to identify 
patterns and rules in complex datasets. For this purpose, this paper presents a comprehensive 
literature review regarding the usage of data mining techniques in the insurance industry by 
the scientometrics approach. For this purpose, first we searched and gathered bibliometrics 
files of recent researches from Web of Science and Scopus into four different scenarios. In each 
scenario, we looked up for different keywords regarding “Data Mining”, “Insurance Industry”, 
and “Risk Management” to make sure that all the results would be specifically focused on the 
research topic. Then, we used R programming software to analyze the results of each scenario 
based on keywords co-occurrence in the given research. 

FINDINGS: The results of keywords co-occurrence and a word cloud of recent research 
confirm that insurance companies should focus on Big Data Analytics instead of traditional 
data processing to get information systematically from too large or complex datasets. Big Data 
Analytics has been used for several years, but in recent years many data-driven businesses, 
like the insurance industry, have used its techniques associated with risk and risk factor 
identification. Risk management in the insurance industry has been widely considered in 
recent researches. Therefore, in this paper, some high-ranked journals and the most significant 
researches have been identified and recommended in order to pave the way for future 
researches in this field. 
CONCLUSION: We hope that the comprehensive literature review provided in this paper can help 
the researchers to focus on the relative journals and researches published then get into more 
details. For this purpose, the lists of all journals and conferences besides the most cited researches 
are provided in the experimental section of this paper. Also, the ranking list of different countries 
from all around the world related to data mining and Big Data Analytics in the insurance industry 
is presented. The results show that Iran is the 15th country that uses data mining techniques and 
it is the 17th country in the world focusing on risk management in the insurance industry. 
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Fig. 3: Cloud network of frequently used keywords in studies related to data mining in the insurance industry 
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Fig. 4: The growing trend of keywords related to data mining in the insurance industry 
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Table 3: Clustering based on co-occurrence network of keywords in data mining studies of insurance industry 
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Fig. 5: Co-occurrence network of keywords in data mining studies in the insurance industry 
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Table 4: Total and average citations to data mining researches in the insurance industry in different countries 
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Fig. 6: The sum of references to the researchers conducted in different countries based on the application of data mining in the 
insurance industry 
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Fig. 7: Co-occurrence network of words over time in data mining studies in the insurance industry 
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Table 5: The number of researches and total citations of articles published in reliable journals in the field of risk assessment in the 
insurance industry 
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Fig. 8: Total citations to researches published in reliable journals in the field of risk assessment in the insurance industry 
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Table 6: Total and average citations to research related to risk and its management in the insurance industry 
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Table 7: Total and average citation of researches related to the detection of violations in the insurance industry 
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Table 8: A selection of authoritative scientific journals in publishing articles related to the discovery of violations in the insurance 
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